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We present a general approach to explain the Zipf's law of city distribution. If the simplest 
interaction (pairwise) is assumed, individuals tend to form cities in agreement with the well-known 
statistics 
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Zipf [|J , half a century ago, has found that the city sizes 
obey an astonishingly simple distribution law, which is 
attributed to the more generic least effort principle of hu- 
man behavior. Let us denote by q m the number of cities 
having the population size to, then R(m) = J q m i dm! 
defines the rank, i.e. the largest city has R = 1, the 
second largest R = 2, etc. Zipf found that empirically, 
R{m) ~ 1/to 7 , with 7 « 1 (see figure [l]). Remarkably 
Zipf showed that the scaling exponent 7 = 1 is very close 
to reality for many different societies and during various 
time periods. More recent data shows some varia- 
tions from the pure 7 = 1 result as also shown in hgure 
[jj. Countries which have a peculiar social structure, such 
as the former USSR or China, do not follow Zipf law. 
For other developed countries Zipf law remains a rather 
good approximation. Such a generic law calls for generic 
explanation, since different countries (e.g. Germany and 
USA) have different cultural, economic structures, and 
their people have innumerable reasons to choose whether 
to live in a big or small city. Yet collectively the society 
self-organises, without express wishes of authorities, to 
obey the Zipf's law. 

Human settlements on the earth's surface appear to 
be clustered, hence cities. This is because individuals 
interact with each other through social, economic and 
cultural ties. In very primitive times an individual (or 
a family) should perform all basic activities to survive; 
there was no need to form a large cluster beyond the 
size of a tribe. In modern times, ever refined mutual 
cooperation/competition brings people to live together. 
Yet this tendency does not seem to lead to a single mega- 
city in the world: many of us prefer to live in a big city, 
equally many may hate and escape it for all its negative 
impacts. Somehow the ensuing compromise results in a 
robust statistical distribution, the Zipf's law. 

All these very general features call for a equally general 
approach to model city distribution. Below we propose 
a general framework using master equations. Let there 
be Q cities and m, citizens in the i th city. The model is 
defined in terms of a master equation, assigning transi- 
tion rates for the growth w a {m,i) or decrease uidimf) of 
the population of a city of size to^. In other words we 
assume that with a probability w a {rni)dt a new citizen 
arrives in city i in the time interval (t,t + dt), so that 
mi — ► mi + 1. With a probability Wd{mi)dt one of the 




FIG. 1. Zipf plot for cities of population larger than 10 5 
in the US and in India, and for a simulation of the master 
equation with p = 10" 3 and W = 10 6 , a = 2. 



to, ; citizens departs so that to, — ► m t — 1. We also assume 
that there is a small probability pdt that a new city is 
created, with a single citizen. Assigning the transition 
rates w a (m) and Wd(m) specifies the model. Note that 
birth and death are not explicitly considered, these can 
be included in the transition probabilities. Note also that 
once an individual leaves a city, not necessarily he/she 
will settle in another city right away, thus the total num- 
ber of city dwellers is not conserved. Thus the whole 
system is composed of the city dwellers and a reservoir 
of unsettled travellers whose number is unregistered. 

One can study this problem introducing the average 
number q m> t of cities of size m at time t, which satisfies 
the master equation: 

d t q m ,t = w d (m+ l)q m +i,t - w d (m)q m .t + p5 m ,i 

+w a {m - l)qw_i, t - w a (m)q m . t . (1) 

The parameters of the equation are the transition rates 
w d (m), p, w a (m). 

The total number n(t) = ^ m mq m ^ t of persons and the 
total number Q of cities are generally not constant: 



9tQ = P ~ w d (l)qi,t 

OO 

d t n=p - [u)d (to) - w a (m)]q m ,t- 

m— 1 
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Let us focus on the stationary state solutions dtq m ,t — 
0, for which q m< t is simply denoted q m , independent of 
time, and n and Q are constant on average. This leads 
us to the equation 

w d (m + l)q m+ i - w d (m)q m + p5 m ,i 

+w a (m - l)q m -i - w a (m)q m = 0, (4) 

where the rate w are also independent of time. The prob- 
lem can be readily handled with the aid of the generating 
function g(s) = J2 m s m q m - 

Let us first consider the linear case Wd(m) = Dm and 
w a (Tn) = Am. This corresponds to independent deci- 
sions by the individuals. It is convenient to choose the 
constants to be A = (1 — p)/n, D — 1/n, where n counts 
the average total number of citizens (constant). In gen- 
eral there is a deficit for each existing city Wd > w a , since 
a departing individual have finite chance to create a new 
city. The above equation is readily solved with the result: 
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(5) 



One can verify that the average number of citizens is 
indeed ^ mq m = n while the average number of cities is: 



Q = 



np\ lnp| 
I-P ' 
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Several remarks are in order: due to the above deficit, the 
odds are always against the existing cities. If the deficit 
is small (or p< 1), fluctuations can still make a large 
city arise. The parameter p sets the cutoff size m* ~ 1/p 
of the power law behavior q m ~ 1/m. The condition 
p -C 1 is equivalent to the statement that the the above 
deficit is small, or that the creation of a new city is a rare 
event, which seems realistic. The distribution q m ~ 1/m 
is very broad. Inverting the rank function R(m), we find 
m(R) = m* exp(— very different from the observed 
Zipf's law. 

The linear model implies no interaction among citi- 
zens. From an individual's viewpoint, the chance to leave 
(or arrive at) a city is independent of the city's size m, 
everybody is free to move around. The simplest interac- 
tion we may consider is pair-wise type, this leads us to 
w ~ m 2 . In this case we choose w a (m) = (1 — p)m 2 /W 
and Wd{m) — m 2 /W. The calculation is, again, standard 
and the result is that 
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Note that q m ~ 1/m 2 , for m <C m* ~ 1/p. Again us- 
ing the rank relation, we find R(m) ~ 1/m, or in the 



more familar form of Zipf m(R) ~ 1/R. Therefore we 
may draw the conclusion that it is the pairwise interac- 
tion that is behind the Zipf law for city distribution, a 
general explanation indeed. Recently Zanette and Man- 
rubia have proposed a city formation model 0. They 
use a multiplication and diffusion process and find that 
their results also reproduce the Zipf's law. A possible 
motivation for such a multiplicative process is that citi- 
zens of the same city are subject to the same aggregate 
shocks, which tend to increase or decrease the population 
size. The implicit assumption is that the strength of such 
random shocks does not depend on the size rrii of city i. 
Our present approach is complementary to theirs: since 
there are more or less m 2 departures and arrivals, the net 
increase (or decrease) 5m of the population of a city in 
a unit time interval, is linearly proportional to m. This 
leads to a system with multiplicative noise. This argu- 
ment shows, on one side that multiplicative noise results 
from a pairwise interaction, on the other that aggregate 
shocks, i.e. random events which affect equally each cit- 
izen of a given city, also would lead to the m 2 transition 
rates in our model. 

On the other hand, the linear case lOj ~ m is char- 
acterized by fluctuations 8m which are proportional to 
y/m. As it has been discussed in another reproduction- 
diffusion system B, this is typical of a system of non- 
interacting individuals. 

These results are confirmed by numerical simulations. 
We show in fig. [j] the Zipf plot of a population of cities 
obtained for w a (m), Wd{m) ~ m 2 . This compares well 
with actual data H. 

One may argue that, in reality, both linear and square 
terms should be present, since an individual's decision 
must have both his independent as well as interactive 
parts. Therefore it is natural to consider the mixed case. 
For the ease of presentation we assume the transition 
probabilities to be: 

2 9 

w a (m) = \-m, w d {m) = e 1/m ( ham). (8) 

m m 

The simplest way to derive the distribution of city sizes, 
in this case, is to use detailed balance: The number of 
cities of size m becoming of size m — 1 is q m ,tWd(m). In 
the steady state, this has to balance the number of cities 
of size m—l becoming of size m, i.e. q m -i t tw a (m — 1). 
This readily gives 
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e -m/rn' r ( m + mQ ) 

m r(m + 1 + omo) 



where the constant C depends on the rate p at which new 
cities are created. 

For sizes m <C mo the distribution is practically the 
same as that for the linear case, q m ~ 1/m. If the cut-off 
size m* <C mo, we conclude that the quadratic term is 
not relevant. If, on the other hand m* 3> mo, the linear 
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behavior q m ~ 1/m holds for m <C Too and then it crosses 
over to a power law behavior 



q m ~ to 



-2-(o-l)mo for mQ < m < m * 



The Zipf exponent is therefore 7 = 1/[1 + (a — 1)to ]. 
Zipf's law 7 ~ 1, obtains only if the pairwise interac- 
tion is dominant, i.e. more precisely if |too(o — 1)| <C 1. 
In the original Zipf's work [Q, the scaling law is valid 
only for large cities. The above results with \a — 1| <C 1, 
allow for a scenario where the distribution of city sizes 
crosses over from Zipf's law with 7 = 1 for large cities 
m mo, to a non- interacting situation q m ~ 1/m for 
small towns m <C mo. The population dynamics in small 
towns is dominated by the linear term in the transition 
rates, which describes non-interacting individuals. In 
large cities, on the other hand, interaction dominates and 
it leads to Zipf's law. Note that Zipf's law, with a general 
value of 7 can also occur if the condition |mo(a— 1)| -C 1 
is not met. 

It is also interesting to consider a general model with 
Wa{m) = (1 — p)m a /W and Wd(m) = m a /W. This in- 
deed allows us to investigate the effects of multi-person 
interactions. For example a = 3 would represent a three 
person interaction. Having found that for a = 1,2 the 
solution is: 
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we can try this solution in the equation. Treating sep- 
arately the cases m = 1 and to > 1 we see indeed that 
this is the solution. Such a solution is also readily found 
using detailed balance. The numbers n and Q are given 
by 
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If a > 2 the integral is finite as p — > and one has simply 
n ~ pW. For 1 < a < 2 the integral diverges as p — ► 
and the leading term is n ~ p a ~ 1 W . In the same way 
one can calculate the number of cities 
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For a > 1 the integral is finite for p = 0, so that Q ~ pW . 

The interesting point here is that the average size of 
cities is finite for a > 2, n/Q ~ ((a~ l)/£(a), where ((x) 
is Riemann's Zeta function. This observation raises the 
question of what happens when the population density 
p = n/Q grows beyond the value p c = ((a — l)/£(a). 

The answer is that the excess population concentrates 
in just one city, which therefore has a finite, large fraction 
of the total population. This effect has been studied re- 
cently Q in an equilibrium model where an Hamiltonian 



H = J^lnmi was considered. The equilibrium distri- 
bution at inverse temperature (3 — a is clearly given by 
a power law distribution. However for a > 2, there are 
two phases: a fluid phase for p < p c {a) which is well de- 
scribed by a power law distribution with an exponential 
cutoff, and a condensed phase where a "mega-city" nucle- 
ates, containing a finite fraction of the total population. 
It is easy to understand this transition from equilibrium 
considerations: Let us consider the state in which we as- 
sign to each city i a random number of citizens m t drawn 
from a power law distribution with exponent a > 2. This 
state minimizes the entropy and it has a density which is 
given by p = n/Q = (to,) = p c (ct). If we want to find a 
state with a lower density, one can introduce a chemical 
potential, which is equivalent to an exponential cutoff on 
the distribution of m^. On the other hand if one wants to 
build a state of higher density p > p c one gets into trou- 
bles. There are two ways out: The first is to put some 
Srrii = p — p c extra citizens in each city. This results in 
a state which has a free energy cost SF ~ Q\n(p/p c ). 
The second way out is to put all the Q(p — p c ) excess 
citizens in only one city. This leads to a free energy cost 
SF ~ \n[Q(p/p c — 1)]. This only grows logarithmically 
with the system size whereas the first variant leads to an 
extensive increase of the free energy. It is then clear that 
in the equilibrium state the system prefers to create a 
"mega-city" to accommodate the excess population. 

This discussion clearly refers to an equilibrium model. 
Metropolis or Montecarlo dynamics of this equilibrium 
system || is different from the dynamics of our master 
equation. Furthermore, and more importantly, we deal 
with a system in which nor the number of cities Q nor 
the population size n are fixed. In other words the den- 
sity p is not fixed. For p« 1 the average density is very 
close to Pc{ct). The density, no matter how it fluctuates, 
will sweep across p c in time. This suggests that, in our 
model, dynamic nucleation of a "mega-city" should occur 
for a > 2. This agrees indeed with the results of numer- 
ical simulations. We show in fig.|] the distribution of the 
fraction m ma x/^ of citizens which live in the biggest city. 
For a — 3, the distribution is peaked at values very close 
to 1 (for a = 2 the critical density diverges p c = 00. Some 
precursor effects of the transition are however visible) . 

This leads to our second observation, that processes 
with a > 2 must result in the nucleation of a "mega- 
city" , which is not very realistic. We can conclude that 
interactions of higher orders (than pairwise) are not rel- 
evant in the dynamics of city formation. 

We see that the interaction leading to the Zipf's law is, 
on one hand, the simplest possible (pairwise interaction). 
On the other it is a rather special one, since it is the 
"lowest order" of interaction which does not lead to the 
formation of a mega-city, which draws a good portion 
of the whole population. The absence of a mega-city 
suggests that in an expansion of the transition rates in 
powers of m, we should neglect terms of order higher 
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FIG. 2. Distribution of the fraction of citizens in the largest 
city. 



than the second. This leads us to the mixed interaction 
case, which gives very realistic results. 

In many disparate societies, it is not unnatural to as- 
sume that individuals make their city-dwelling decision 
based on their own opinions as well as on its interaction 
with other citizens. If this indeed is the case, we show 
that the larger cities obey approximately the Zipf 's law. 
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